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DUPLICATE 



TRAFFIC FLOW OPTIMISATION SYSTEM 

This invention relates to traffic flow optimisation systems. More 
particularly, but not exclusively, it relates to methods of calculating data 
5 traffic flows in a communications network. 

Background , 
Today, large communications networks are serviced by more than 
30,000 Internet Service Providers (ISPs) across the world, predominantly 
operating on a commercial basis as a service provider. The services range 
10 from the mass-marketing of simple access products to service-intensive 
operations that provide specialized service levels to more localized internet 
markets. The present application mainly concerns ISPs providing networks, 
referred to more generally as network service providers. 

With networks playing an ever-increasing role in today's electronic 
15 economy, an efficient management of the networks and an efficient planning 
of future modifications is advantageous. 

With the rapid growth of network usage, network service providers are 
currently facing ever increasing expectations from their customers to the 
quality of service (minimum delay, maximum reliability, high bandwidth for 
20 data transfer rates, low costs, etc). The main task is to satisfy the quality of 
service parameters while maximising the return of investment, i.e. to ensure 
an efficient utilisation of the available bandwidth in addition, but there are 
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issues relating to future sales, strategic planning and business development. 
One concerns Service Level Agreements (SLAs) with their customers. 

An SLA is a service contract between a network service provider and 
a subscriber, guaranteeing a particular service's quality characteristics. SLAs 
5 usually focus on network availability and data-delivery reliability. Violations 
of an SLA by a service provider may result in a prorated service rate for the 
next billing period for the subscriber. 

Thus it is important for a network service provider to know whether it 
can issue an SLA to another customer without violating any existing SLAs. 
10 Here it needs to estimate what is the largest new workload that the network 
can handle with respect to network availability and data-delivery reliability. 

In strategic planning, the objective is to investigate how to revise the 
existing network topology (and connection points to external nodes) such that 
the resource utilization is minimized and more balanced. The problem is to 
15 minimize the resource utilization by determining which backbone links are 
overloaded, and to add links to the network to redistribute the load. For a 
given workload, the question is where to add the links in the network and 
what is the required bandwidth of each backbone link such that the capacity of 
the new network topology is greater, i.e. can deal with a larger load. Another 
20 question concerns which node should be used to connect a new customer to 
the network. This should be the one which minimises resource utilisation and 
expands capacity in the most cost effective way. 
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There are several tools available for planning and optimising 
communications networks. They are usually based on a discrete event 
simulation of the network. For this, a typical scenario of end-to-end loads 
must be provided as input by the user. Starting from this scenario, the tool 
5 simulates the transport of the traffic through the network, observing link 
utilization overload and other events. 

End to end traffic data are usually obtained by obtaining probes or 
router-based information. Such a method is expensive to implement and 
usually only a part of the whole network is equipped with data collection 
10 points. The data collection process has the additional disadvantage that it 
adds to the traffic congestion of the network. 

Such traffic data, if available, may then be used in network modelling 
or a network simulation. However, most simulations are not based on real 
data, but only on estimates. The simulation is then used for network planning 
1 5 or optimisation tools. The user usually defines a scenario which is then tested 
using the simulation tool. 

Another approach is described in the applicant's patent application GB 
0 028 848, filed on 27 November 2000 (agent reference J42831GB), which is 
hereby incorporated by reference. In the approach real link traffic data are 
20 used to derive end-to-end traffic load intervals. Link traffic data are collected 
during a limited time period. Thus a "snapshot" of the data traffic is obtained. 
Data collection may be repeated at certain times or periodically. The results of 
this process are subsequently used in a traffic flow optimisation system to 
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estimate traffic loads in the given scenario. This approach has the 
disadvantage that it is difficult to derive meaningful intervals for the end-to- 
end load in the optimisation process, especially if multiple modifications are 
to be continued to give a consistent estimate in traffic flow. Another 
5 disadvantage of the method is that it is difficult to work from a plurality of 
traffic flow intervals in the optimisation or planning process. 
Summary of the Invention 

It is an aim of the present invention to alleviate some of the 
disadvantages mentioned above and to provide a traffic flow optimisation 
10 system based on measured traffic data which does not require complete end- 
to-end traffic loads. 

It is a further aim of the present invention to calculate traffic values in 
a communications network for a modified scenario using measured traffic 
data of the initial network. 
15 According to one aspect of the present invention, there is provided a 

method of calculating traffic values in a communications network, the 
communications network comprising a plurality of nodes, the nodes being 
connected to one another by links, the method comprising: (a) obtaining 
traffic data measurements through said nodes and/or links in an initial 
20 scenario as input data; (b) deriving a traffic flow model for a modified 
scenario, using a plurality of constraints describing the interdependency of 
said initial to said modified scenario; and (c) calculating values and/or upper 



and lower bounds of traffic values for said modified scenario from said traffic 
flow model using said input data. 

In this way traffic values can be calculated for a modified network (or 
modified scenario) using measured traffic data of the initial, un-modified 
5 network (or scenario). 

By deriving constraints from the interdependency of the initial and. 
modified network, the exact traffic data are used in the calculation for the 
modified scenario if they are not affected by the modification. In this way 
either exact values or relatively tight bounds can be derived for the desired 
1 0 traffic values in a modified network. 

Preferably, the measured traffic data are corrected if inconsistencies 
are detected. In this way more accurate and reliable traffic values can be 
derived. 

According to another aspect of the present invention, there is provided 
15 a method of calculating traffic values in a communications network, the 
communications network comprising a plurality of nodes, the nodes being 
connected to one another by links, the method comprising: (a) obtaining data 
traffic measurements through said nodes and/or links in an initial scenario as 
input data; (b) considering a modified scenario; (c) defining one or more 
20 solution variables for said modified scenario; (d) determining constraints 
between traffic flows through said links and nodes in the initial and the 
modified scenarios; (e) deriving a traffic flow model using said input data and 
said relations for calculating said solution variables. 
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In this way traffic values for modified scenario, like for example in 
forecasting, network planning or resilience analysis, can be calculated using 
measured traffic data of the initial, un-modified network and constraints based 
on the initial network, like network topology and network behaviour of the 
5 initial network. 

Further aspects and advantages of the invention will be appreciated, by 
example only, from the following description and accompanying drawings, 
wherein 

Figure 1 illustrates a simplified example of an Internet Protocol (IP) 
1 0 network in which the present invention can be implemented; 

Figure 2 illustrates the relationship of a network and a network 
management system, into which the present invention may be implemented; 

Figures 3A to 3J illustrate in a simplified way the method used to 
calculate the traffic values in a modified scenario according to the present 
15 invention; 

Figure 4 is a flow chart diagram illustrating the individual steps of a 
traffic flow optimiser according to embodiments of the present invention. 

Figure 5 is a flow chart diagram illustrating individual steps in a traffic 
flow optimiser according to one further embodiment of the present invention. 
20 Figure 1 illustrates a simplified example of an Internet Protocol (IP) 

network. Generally, such a network consists of nodes and links. Nodes in an 
IP network may either be internal or external. An internal node represents a 
location in the network where traffic data is directed through the network. It 
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can be of two types: a device node 2 denoting a network device such as for 
example a router or a network node 4 denoting a local network (e.g. Ethernet, 
FDDI ring). An external node 6 represents a connection to the IP network 
from other networks. 

5 A link is a directed arc between two nodes. Depending upon the 

nature of the two nodes, network links take different denominations. There are 
two main types of links, the backbone and access links. 

A backbone link 24 is a link between two internal nodes; at least one 
of them must be a device node. Indeed, every link in an IP network can 
10 connect either two device nodes or one device node and one network node. A 
connection to a device node is realized via a physical port of the device. If the 
device is a router, then the port is called router-interface 22. A connection to 
a network node is realised via a virtual port of the network. 

An access link is a link between a device node and an external node. 
15 Access links include peering lines 18, uplink lines 20 and customer lines 16. 

A PoP (point of presence) is the set of all devices co-located in the 
same place. 

The link traffic is the volume of data transported through a link and is 
measured in mbps (mega bits per seconds). The bandwidth of a directed link 
20 defines the maximum capacity of traffic that can be transported through this 
link at any one time. 

In this embodiment, we refer to IP-based networks. In an IP network 
the device nodes are represented by routers; we assume that any two nodes are 





directly connected by at most one link, and every external node is directly 
connected to one internal device node. 

A path from a source to a destination node is a sequence of linked 
nodes between the source and destination nodes. A route is a path between 
5 end-to-end internal nodes of a network that follows a given routing protocol. 

A traffic load is the load of data traffic that goes through the network 

i 

between two external nodes independently of the path taken over a given time 
interval. 

A traffic flow between external nodes is the traffic load on one specific 
10 route between these nodes over a time interval. 

A router is an interconnection between network interfaces. It is 
responsible for the packet forwarding between these interfaces. It also 
performs some local network management tasks and participates in the 
operations of the routing protocol (it acts at layer 3, also called the network 
15 protocol layer). The packet forwarding can be defined according to a routing 
algorithm or from a set of static routes pre-defined in the routing table. 

A routing algorithm generates available routes for transporting data 
traffic between any two nodes of a given network. 

In the following, embodiments of the present invention will be 
20 described which are based on the OSPF (Open Shortest Path First) routing 
algorithm for IP-based networks. This algorithm determines optimal routes 
between two network nodes based on the "shortest path" algorithm. The 
metrics to derive the shortest path are fixed for each link, and hence the 



routing cost is computed by a static procedure. The lowest routing cost 
determines the optimal route. If there is more than one optimal path, then all 
optimal paths are solutions (best routes) and the traffic load between the two 
end-nodes is divided equally among all the best routes. 
5 Figure 2 illustrates the relationship of the network and a network 

management system 60 in which the present invention may be implemented. 
Network management system 60 performs the network management functions 
for network 50. The network management system communicates with the 
network using a network management protocol, such as the Simple Network 
10 Management Protocol (SNMP). The network management system 60 
includes a processor 62 and a memory 64 and may comprise a commercially 
available server computer. A computer program performing the calculation of 
data traffic flow is stored is memory 64 and can be executed by processor 62. 
The traffic flow optimisation system may comprise a network planner, 
15 a resilience analyser and/or a forecaster. These elements may either be 
implemented in the network management system or in a connected computer 
system. 

A network planner identifies potential network bottlenecks using the 
traffic flow results, suggests a set of best possible changes to the network 
20 topology and reports on the analysis of the effects of such changes to the 
network and services. Such changes may include topological changes or 
methods of traffic engineering like modification of the routing metrics or the 
implementation of tunnels, i.e. methods of routing traffic in one particular 
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node in different directions. Such a tunnel may for example be used to route 
traffic coming into the node from a first and a second node into one direction, 
and traffic coming into the node from a third node into another direction 

The resilience analyser identifies which of the existing links in the 
5 network will be overloaded in the event that certain links (sets of links) fail. 

The forecaster identifies the impact on the network of rising traffic 
volume. 

Input data for the network optimisation system are network data and 
traffic data. 

10 Network data contains information about the network topology, as for 

example information about the nodes, routers, links, the router interfaces, the 
bandwidth of each link or the parameters of the routing protocol used. Such a 
routing protocol may for example be the OSPF (Open Shortest Path First) 
protocol. Alternatively, other routing protocols like ISIS (Intermediate 

15 System to Intermediate System) and EIGRP (Enhanced Interior Gateway 
Routing Protocol) may be used. In addition, information about the transport 
layer may be used, such as the TCP transport protocol or the UDP transport 
protocol. The list of static routes may also be included. Network data may 
also include information about the end-to-end paths as for example all optimal 

20 routes. 

Traffic data are measured directly from the network considered. 
Suitable network elements have to be selected from which information is 
provided and in which measurements of traffic data are made. The measured 



quantities may for example be the flow between two nodes, the flow between 
two groups of interfaces for two nodes, the traffic entering or leaving a link. 

In the embodiments described, traffic data are provided as snapshots 
of current traffic on the network. Traffic data are collected for a certain time 
5 interval and are collected on a regular basis. According to one embodiment, 
traffic data are collected every 5 minutes and the user may choose the interval 
over which data is collected. A time interval of, for example, 20 to 25 
minutes is suitable. Measurements relating to the average flow rate of traffic 
data passing through every link are taken and stored. The average flow rate of 
10 traffic data is the total traffic volume divided by the time interval. For 
example if a total traffic of 150 Mbit have been counted between Tl and T2, 
the average data is 150/(T2-T1) Mbps. This approach does not show high 
peaks in the traffic. But it is sufficient to determine the load being carried 
through a network. 

15 According to the embodiments described herein, traffic data is 

collected either directly from the network devices or from tables stored in the 
network management system. The network management protocol such as the 
SNMP provides access to the incoming and outgoing traffic at a given time at 
each router and each router interface. 

20 The general concept of the traffic flow optimiser will now be 

explained with help of a simplified model. 
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Figure 3A shows a simplified network including network nodes A to 
G. The snapshot traffic measurement shows a link traffic between node F and 
E of 20, between G and E of 35 and so on. 

Figure 3B shows intervals of the flow variable (i.e. end-to-end flow) 
5 which can be derived from the measured traffic data as is described in patent 
application GB 0 028 848. For example the traffic flow FA can be 
determined to be in the range 0 to 10, then flow GB is between 20 and 35. 

In a resilience analysis we are now interested in the traffic on the 
individual links if one link fails. Figures 3 C and D illustrate a link failure 
10 between nodes E and C. All traffic between nodes E and C has now to be 
re-routed. In the example the traffic goes via node D. 

In the example we are now especially interested in the traffic between 
E and D. The important question in a resilience analysis is whether the link 
considered, here the link between nodes E and D can handle the total amount 
15 of traffic which now, after the link failure, flows over the link. 

Figure 3B illustrates that in the normal case only the traffic from F to 
D and G to D uses the link ED. In case of the link failure all traffic on the 
routes FA, FB, FD, GA, GB and GD goes over this link. 

Thus we want to know what the total volume for all these flows is. 
20 We can derive the line traffic in the modified network (i.e. where the link 
failed) from the original, un-modified network, as is illustrated in Fig. 3E. 

In this case we can use two constraints from the initial network. 
Looking at the traffic between nodes F and E, we know that all traffic FA, FB 
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and FD goes over this link and from the measurement we know that the total 
traffic on this link is 20. Looking at the link between nodes G and E, we 
know that traffic GA, GB and GD goes over the link and that the total traffic 
on this link is 35. Thus we can derive a value for the traffic FA + FB + FD + 
5 GA + GB + GD, which yields 20 + 35 = 55. In this example we can use a 
second constraint, from which we also get a value for FA + FB + FD + GA + 
GB + GD. Looking at the traffic on links EC and ED also yields a value of 
55. 

Thus we can estimate the utilisation of the individual links after link 
10 failure. The values are shown in Figure 3F. 

In a second example we illustrate how we can derive traffic data for a 
planned topology change. We again start from initial network as shown in 
Figure 3A. 

Referring to Figure 3G, the question is now how the traffic data would 
15 be modified if we add a new link between nodes E and B. We can again 
derive the values for the modified network by using constraints derived from 
the initial network. In Figure 3H the resulting utilisation of the links is 
shown. We can derive that the traffic on a new link EB would be 40. 

In a third simplified example we show that we can not always derive 
20 exact value for the traffic in which we are interested, but that we might only 
be able to derive an upper and lower bound for certain traffic values. 

Referring now to Figure 31, the question is how the traffic data would 
look like if we add a new link between nodes G and B. In this example we 




have not enough information or constraints to derive an exact value for the 
traffic on the new link GB. As already shown with reference to Figure 3D, 
we are able to derive an interval for the link traffic between GB, which now 
tells us that the flow is between 20 and 35. With this knowledge and the 
5 measurements of the traffic in the initial network, we can also derive link 
traffic intervals for the links EG, CE and BC, as given in Figure 3 J. 

The traffic flow optimiser in embodiments of the present invention 

i 

uses measured traffic data from an initial network to derive traffic values for a 
modified scenario. The modified scenario might for example be a modified 
10 topology of the network or a modified routing procedure like the introduction 
of primary tunnels. These modified scenarios are for example used for 
network planning and resilience analysis. Other modified scenarios include a 
modified traffic load in the network, like for forecasting traffic. 

The traffic values for the modified scenario are calculated using the 
15 same principles as explained above in the simplified example with reference 
to Figure 3. In the example, the link traffic values for the modified links are 
expressed as a function of the measured traffic data in the initial network to 
derive a value for traffic over a certain link. More generally, relationships 
derived from the initial scenario are used to constrain the traffic values of the 
20 modified scenario. In the embodiments described below, we set up a traffic 
flow model or a constraint model is defined for calculating traffic in the 
modified scenario. Because generally the exact amount of traffic cannot be 
determined, we calculate an upper and lower bound of data traffic. 
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In the constraint model, the flow terms contain typically several 
node-to-node flows. Any link traffic which is not affected by a modification 
corresponds to the link traffic of that link in the initial un-modified network 
and is thus known accurately. 

From the calculated flow data (i.e. values and/or intervals) we derive 
the utilisation of individual links in the modified network. 

In the following we list some of the relationships used in the constraint 
model as examples: 

• the sum of all flows starting in a node in equal to the sum of all 
external traffic entering the node; 

• the sum of all flows ending in a node in equal to the sum of all 
external traffic having the node; 

• the sum of all flow through a link is equal to the total traffic on 
the link. 

Other constraints will be given below, where we describe different 
embodiments of the traffic flow model in more detail. We also refer to the 
description of the constraint model for the traffic flow analysis in then 
applicant's patent application GB 0 028 848, filed on 27 November 2000 
(agent reference J42831GB). The set of constraints described therein can 
alternatively be used in the traffic flow model of the present invention. 

The output variables of the traffic flow optimizer are the flows which 
are affected by the imposed modification. These variables are referred to as 
solution variables in the following. Objective functions are defined using 
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linear programming methods to calculate the bounds for all solution variables. 

The traffic flow optimiser can be used to analyse the results of a whole 
set of modifications. This is for example useful for a resilience analysis of a 
communications network where the service provider might want to ensure 
5 that the network has enough capacities to deal with the failure of any such 
link. 

One possibility to handle such multiple modifications is of course to 
treat each modification individually and to get results for the traffic values for 
each modification individually. However, the traffic flow optimiser according 
10 to one embodiment of the present invention also allows handling a set of 
modifications simultaneously. In this case the optimiser performs the routing 
process for each modification considered (if necessary) . and produces the 
solution variables for each modtflCHticn: 

After all modification scenarios have been considered, the optimiser 
15 produces a list of solution variables, which is then used in the traffic flow 
model. In this list all solution variables are only listed once, so redundancies 
are removed. The set of modifications are then handled simultaneously in the 
traffic flow model. Possible outputs are a set of consistent values for all 
solution variables, which gives a solution to all constraints considered. 
20 Alternatively, or in addition, the modifications can also be handled one by one 
in the model, the output of the TFM is then a set of solution variable values or 
intervals for all modification considered. 




The traffic flow optimiser also allows to handle a plurality of 
measured data sets. In the planning of a network change or in a resilience 
analysis, it is often desired to take into account multiple traffic data sets. In 
this way we do not restrict ourselves to the use of one particular data set 
5 which was collected at one particular time interval in the network considered. 
Instead, multiple data sets collected at different times may be used in one 
analysis. This is important as the traffic may considerably differ at different 
times. The results obtained in this way are thus much more consistent and 
meaningful. In case of a resilience analysis, if an utilisation problem is 
10 detected for the some network failure in all data sets, i.e. to all times 
considered in the analysis, then this points out a much more serious problem 
compared to a situation where the problem occurs in one data set only. 

We analyse the multiple data sets independently and then combine the 
results. In this way we keep all information about correlation between the 
15 different traffic values. However, this method requires running of the traffic 
flow model for each data set separately, which may be very resource- 
consuming, especially if we consider a great number of data sets. 
Alternatively, we combine the multiple data sets into one merged data set and 
run the traffic flow optimizer based on this merged data set. In this case we 
20 loose the correlation between the individual traffic data measurements. This 
has to be taken into account when setting up the constraints for the traffic 
flow model. 
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The measured traffic data may be inconsistent. As such 
inconsistencies affect the TFM, the data are corrected in embodiments of the 
present invention. 

A reason for inconsistencies might for example be that some network 
data is lost (e.g. the collected data from a router is not transported through the 
network). Other reasons might be that the network configuration is wrong 
(e.g. the counter for the size of traffic on a given interface does not represent 
the size of that traffic) or that the network data from different router interfaces 
(in case of a router-router directed link) is not collected at the same time and 
results in slight differences of size. 

The data are corrected using the measured raw data and constraints 
derived from the topology of the network and routing protocols. The 
following constraints may tor example be used for error correction: 

• for each route, the total traffic coming in is equal to the total 
traffic going out; 

• for each link between the interface j of router i and the 
interface 1 of router k, then traffic data on the interface j from 
the router i is equal to the traffic data on the interface 1 directed 
to the router k. 

If the constraints are fulfilled, i.e. if no error is detected, the input 
values are not corrected and the process continues. If an error is detected, 
then the error is minimised using the constraints and a minimisation 
procedure. Error variables are introduced, for example variables specifying 
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the corrected traffic volume for every single node or an overall error 
correction. An objective function is built from the constraining relationships 
using linear programming. 

Output of the correction procedure are the error variables. Using these 
5 variables, the measured data can be corrected such that they are consistent for 
the application of a constraint model. 

The procedure of error correction is similar to those described in 
patent application GB 0 028 848. We refer to this document for further 
details. 

10 Before- the individual steps in a traffic flow optimiser will be set out in 

more detail reference is made to the flow chart diagram of Figure 4. Figure 4 
summarises the steps of a traffic flow optimiser according to one embodiment 
of the present invention. Input to the optimiser is the measured traffic data, 
the topology of the network and the behaviour of the network, such as the 
15 results of performing a routing procedure (steps 102 and 104). The traffic 
data are corrected in step 106 such that a consistent data set is obtained. In 
step 108 a modified scenario is considered. The modified scenario may for 
example include a scenario for planning modified networks (110), for 
analysing resilience of a present or future network (112) or for forecasting 
20 traffic load (114). The topology of the initial and the modified network, if 
appropriate, is used to derive constraints for setting up a traffic flow model 
(1 16). The traffic flow model runs to determine intervals of a set of traffic 
flow solution variables in the modified network and/or one consistent solution 
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for the set of modifications considered. In step 118 the output of TFM, i.e. 

the solution variable intervals, are analysed. The traffic flow optimiser may 

for example be used for selecting a suitable modification or set of 

modifications from the traffic flow intervals calculated for the different 
5 modifications considered. For the resilience analysis, the solution analyser 

may for example transform the solution variables into utilisation values. 

The principle set out above can then be used in optimisation 

procedures like for example forecasting, resilience analysis, or network 

planning. 
10 Forecasting 

For forecasting traffic values in a given communications network, the 
network topology is not changed. However, the traffic load of the network is 
modified. Two different types of modification can be implemented. The first 
modification is a growth of the different end-to-end flows proportional to their 
15 original traffic. The second modification allows for additional traffic which 
can be specified to flow between certain nodes. However, the additional 
traffic should be allocated to nodes which have enough capacities for 
extending the traffic load (see the description for network planning below). 

The traffic flow model is then set up such that for each node to node 
20 flow the correct growth factor is used and any additional flows are added. 
Output of the forecasting process are then modified traffic values (generally 
intervals of the link traffic) for which we can forecast the utilisation of the 
network if the traffic load growths. The constraints can be aggregated for 
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groups of flows, e.g. for all flows between two PoPs. Often, we will obtain 
more accurate results this way. 
Resilience analysis 

In order to study resilience of the network, the communications 

5 network is modified. Individual or multiple network elements, such as links 
between network nodes are removed in the analysis and the impact of such 
"failures" on the network are studied. Node failures are simulated by 
removing all links connecting to the corresponding node. For simulating a 
router node failure, the links inside a PoP and also the links connecting to 

10 another PoP can be removed. We can also treat any combination of link 
failures as a single failure cases, for example to analyze the effect of a fibre 
cut. 

After the network modification, the routing procedure is performed in 
the modified network. The solution variables to be determined are defined. 
15 They correspond to the link traffic values in the modified network. 
Subsequently the TFM is set up using the constraints derived from the 
network topology and behaviour and the measured and corrected traffic data. 
As a result of the TFM upper and lower bounds are calculated for the desired 
solution variables and the utilisation of the derived links can be calculated. 
20 The optimiser further returns the amount of any un-routed traffic. 

In such a resilience analysis a whole set of modifications to the 
network can be studied as described above. The information of the individual' 
modification can be given as a list of network modifications, and the traffic 
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flow optimiser runs through all given modifications automatically, giving a 
list of the desired network element utilisation and the un-routed traffic for 
each modification. 
Network Planning 

5 With the network planner the impact of new network elements, such 

as new links, can be studied. 

The user can either specify to which node and/or interface of a node 
the new links should be connected. Alternatively* the user can specify one 
node of each PoP as the extension point, and the tool automatically selects the 

1 0 best point of connection for the new links. 

Similar to the resilience analysis, routing is performed in the modified 
network, the TFM is set up and intervals are calculated for the desired 
solution variables. Again a list of selected modifications can be studied, 
either by handling each modification individually, or by combining multiple 

15 modifications. 

In other embodiments, more specific queries can be answered by the 
traffic flow planner, the forecaster or the resilience analyser (110, 112 or 114 
of Figure 4). In this case the solution analyser 118 (or part of it) is 
incorporated into the traffic flow planner, the forecaster or the resilience 

20 analyser. Each of elements 110, 112, 114 can for example define a specific 
set of queries which is then calculated in the TFM using the complete set of 
constraints as described above. 
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In this case the step of calculating variables the utilisation of 
individual links are performed in the optimiser elements 1 10,1 12 or 1 14. The 
solution variables are then no longer link traffic values, but variables like for 
example the utilisation of a link (in percent), the overall utilisation of the 

5 modified network (in percent), the maximal utilisation of any link in the 
network, the overloading of a link (in percent), the total volume of overloaded 
traffic or the volume of traffic in the original network that cannot be routed in 
the modified network. 

As a consequence of using these variables the constraints used in the 

10 TFM are no longer linear. Such constraints are for example that the amount 
of overload for a link is the percentage of utilisation exceeding the utilisation 
limit or that the maximal utilisation is the maximum of all link utilisations in 
the network. We use linear approximation for these non-linear constraints. 

The forecasting element 112 or planning element 110 may for 

15 example directly answer the following questions: 

• What is the utilisation of each link? 

• What is the overall utilisation of the network? 

• Are there any links that are overloaded? 

The variables derived in such a case are the minimal and maximal 
20 utilisation of any link in the network (in percent), the overall utilisation of the 
network and the total amount of overloading in the network. 

The queries of the resilience analyses may be specified as for example: 
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• For each change, what is the utilisation of each link and which 
flows are lost, i.e. no longer transported in the network? 

• For each change, are there overloaded links and which flows 
are lost? 

5 In a resilient planner, the query may for example be formulated as: 

• For each change, is the resulting network resilient and what is • 
the utilisation of each link? 

The traffic flow optimiser is also useful in traffic engineering, for 
example for obtaining information about possible placements of primary 
10 tunnels. 

Referring now to Figure 5, we describe how a traffic flow optimiser 
handles multiple network modifications like for example in a resilience 
analysis. In this embodiment, the traffic flow optimiser includes a 
modification generator 212. The generator 212 is implemented in software. 

15 It automatically generates a set of proposed modifications, like for example 
deleting every link in a network or every link in a particular set of links in a 
resilience analysis. Alternatively, the user may specify a set of modifications 
for a particular analysis. The set of modifications are then communicated to 
the planning or resilience element via an application programming interface 

20 (API). The optimiser element then considers a first modification of the 
network (step 214) as requested and routing is performed on the modified 
network in step 216. In step 218 the optimiser element performs flow 
aggregation and the required flow terms or solution variables are defined. In 
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step 220 it is checked whether more modifications are to be carried out. If the 
answer is yes, then steps 214 to 220 are repeated, until no further 
modifications are required in step 220. If the answer is no, then the system 

continues with step 222. 
5 In step 222 the optimiser element merges the required flow terms into 

a list. In this process any duplicates of flow terms are removed, such that a 
list of unique flow terms is provided to the TFM (step 224). In addition to 
this list, the TFM is also provided with information of the network topology 
and behaviour of the initial network and the measured traffic data (steps 202 
10 to 206). Now the traffic flow model is set up by deriving constraints and 
building objection functions as described above. The output of the TFM is an 
upper and a lower bound for each of the solution queries. Alternatively, or in 
addition, one solution for all the solution variables can be calculated, which 
simultaneously satisfies all given constraints. 
1 5 The resulting intervals or values are transformed into utilisation values 

of the links and/or values of the modified network in step 226. From these 
data the user or the system may select appropriate network modifications (step 
228). If the TFM calculates directly solution variables like the utilisation 
variables and overflow values, the modification selector uses the values or 
20 intervals provided from the TFM. The modification generator of step 212 
provides the necessary information for the selection of suitable or test 
modifications. If the traffic flow optimiser is used for forecasting traffic the 
definition of the modified scenario is given in step 230. As the network itself 
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is not modified in this case, steps 214 and 216 are not performed and the 
modified traffic load is directly communicated to the optimiser in step 218. 
The modified traffic load is given as a percentage of the current traffic load. 

In the following we describe different embodiments of the constraint 
model for the traffic flow optimiser in more detail. 

As a first example, we give an example of a simple constraint model. 
In this simplified model, network nodes and routers are treated in the same 
way. External connections are not modelling explicitly. Only the total traffic 
volume into and out of a node to all external interfaces is taken into account. 

In a second example, we give a further, more complete, example of a 
constraint model. We refer to this example as the full constraint model. In this 
case we distinguish between network nodes and routers. Node interfaces can 
be classified as interfaces of different groups if they belong to different 
customers. 

In a third example, the full constraint model described in the second 
example is presented including error correction. 

Subsequently the constraint model is described for the traffic flow 
optimiser according to one embodiment of the present invention. This 
description is based on the third example, i.e. the full constraint model 
including . error correction. We present descriptions of the variables, 
constraints, approximations to non-linear constraints and solution variables 
used in this embodiment. Specific queries for a forecaster, a resilience 
analysis, a planner and also a resilience planner are given. 
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In a further embodiment of the present invention, we describe a traffic 
flow model, which is especially useful for large networks. In this model we 
introduce additional approximations. As a result, the overall number of 
variables to be calculated is reduced. The remaining variables are 
approximated in this model. 
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Simple Constraint Model 



3.1 Constants 

We start by defining the constants that we use to describe the network, its 
routing and the traffic loads in the network. 

Definition 3.1 Nodes is the set of all nodes in the network. The number 
of nodes is denoted by n. Indices i, j, k, k x , I and l x refer to nodes. 

Definition 3.2 Pops is the set of all PoPs in the network. The number of 
PoPs is denoted by r. Indices p and q refer to PoPs. Each PoP is a set of 
nodes, and each node belongs to exactly one PoP. 

Definition 3.3 The function pop : i M- p maps the node index i to a PoP 
index p. 

Definition 3.4 Lines is the set of all node pairs kl so that a directed line 
exists between nodes k and I in the network. The number of lines in the 
network is denoted by m. • 

Definition 3.5 The constant t u is the consistent traffic volume on the di- 
rected line between from node k to node I. 

Definition 3.6 The constant cj n is the sum of all consistent traffic volumes 
for external traffic into node i. 

Definition 3.7 The constant c? ut is the sum of all consistent traffic volumes 
for external traffic out of node i. 

Definition 3.8 The constant R kl is the set of all node pairs ij so that the 
flow from node i to node j is (partially) routed through line kl. 

Definition 3.9 The constant rH 1 is a number between 0 and 1 which de- 
scribes which fraction of the flow from node i to node j is routed through line 
kl. A value 0 indicates that the flow is not routed through line kl. 

Definition 3.10 The constant pg q is either 0 or 1 and indicates whether 
the flow from node i to node j is a flow from PoP p to PoP q,i.e.i€p and 
j 6 q. 
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Lemma 3.1 We have that 



V i,je Nodes: £ p?/ = 1 



pePopa 



holds for all nodes i and j. 



3.2 Variables 

We now describe the variables of our model. The first set, the flow variables, 
are used to define flows in the network consistently. It is the second set, the 
solution variables, that we are really interested in. These solution variables 
are defined as sums of flow variables. 

Definition 3.11 The non-negative variable fy describes the flow from node 
i to node j. These variables are called flow variables. 

There are n 2 variables of this type. 

Definition 3.12 The non-negative variable s pq describes the flow from PoP 
p to PoP q. These variables are called solution variables. 

There are r 2 variables of this type. 
3.3 Constraints 

We now list the constraints used in our minimal model. The names of the 
constraints are taken from the "RiskWise II Constraint Model" document. 

Constraint 3.1 (flow_external(2)) The constraint states that the sum of 
all flows starting in a node is equal to the sum of all external traffic entering 

the node. ^ 

V ie Nodes: £ fij = cj n 

j 6 Nodes 

We have n constraints of this type. 

Constraint 3.2 (flow.external(l)) The constraint states that the sum of 
all flows ending in a node is equal to the sum of all external traffic leaving 
the node. ' 

V je Nodes: £ A* * °? 

ie Nodes 



We have n constraints of this type. 
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Constraint 3.3 (traffic-Contribution_equal) The constraint states that 
the sum of all flows through a line is equal to the traffic on the line. 

V kl € Lines : ]jT rg * fij = t k i 
We have m constraints of this type. 

Constraint 3.4 (solution_term(l)) The constraint states that the flow be- 
tween two PoPs is equal to the sum of all flows between nodes which belong 
to the first and second PoP. 



V p,q € Pops : s pq = 
We have r 2 constraints of this type. 



j eg 



3.4 Matrix 



The matrix form of the problem is defined by the set of equations 
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x This section can be skipped on a first reading. 
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The matrix M is defined as 
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The constraints are arranged in the sequence 

1. flow_external(2) 

2. flow_external(l) 

3. trafficcontribution^equal 

4. solution-term(l) 

The matrix has 2n + m + r 2 lines and n 2 + r 2 columns. 

Remember that only one in each column has the value 1, all others 
are 0. This means that 100/r 2 (CWG: 11) percent of the p?/ values are non 
zero. 

Experiments with the CWG network have shown that most lines carry at 
least n flows, with some carrying 1000 flows. This means that roughly 5-10% 
of the r§ values are non-zero. 

To highlight the structure of the matrix, we can also show it as the com- 
bination of multiple smaller matrices. 
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with the smaller matrices 



ONE x = 
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where the line of ones is line x. The size of this matrix is n x n. 
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The size of this matrix ismxn. 
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\P7l pS 
The size of this matrix is r 2 x n. 

Lemma 3.2 We /ind that P x = F y if pop(x) = pop(y). 
3.5 Objective Functions 

As result of the flow analysis we want to have lower and upper bounds for 
all solution variables. This implies the objective functions 



and 
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max s pq 
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which are run independently for all pairs of PoPs p and q. This means that 
we have to run 2 * r 2 optimization queries. We can reduce this value by 
checking if a particular solution variable s pq has a value 0 in one of the other 
optimization runs. If that happens, we do not have to check it for minimum 
value. 



» 
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Full Constraint Model 

The main difference between the simple constraint model as set out above and the 
full constraint model as described in the following is that we distinguish between 
the different types of external interfaces. We also allow a separation of customers 
interfaces into groups 1 and 2. If the constraint model is used to perform an 
analysis of the traffic between PoPs, group 1 will contain all customer interfaces 
of a node and group 2 will be empty. For other analyses, some interfaces belong 
to group 1 and others to group 2. 



4.1 Constants 

We start by defining the constants that we use to describe the network, its 
routing and the traffic loads in the network. 

Definition 4.1 Nodes is the set of all nodes in the network. The number 
of nodes is denoted by n. Indices i, j, k, k x , I and l x refer to nodes. 

Definition 4.2 Routers is the set of all routers in the network. The num- 
ber of nodes is denoted by n r . 

Definition 4.3 Nets is the set of all network nodes in the network. The 
number of nodes is denoted by n n . 

Lemma 4.1 We have 



Definition 4.4 Pops is the set of all PoPs in the network. The number of 
PoPs is denoted by r. Indices p and q refer to PoPs. Each PoP is a set of 
nodes, and each node belongs to exactly on PoP. 



Nodes = Routers U Nets 



and 



Routers fl Nets = 0 



Definition 4.5 The function pop : i »-» p maps the node index i to a PoP 
index p. 
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Definition 4-6 Lines is the set of all node pairs kl so that a directed line 
exists between nodes k and I in the network. The number of lines in the 
network is denoted by m. 

Definition 4.7 The value o denotes the number of interconnection lines in 
the network. 

Definition 4.8 The constant h\ denotes the number of interface groups 
found at node i. The value hi is equal to the number of interconnection 
lines attached to node i plus 2. 

Lemma 4.2 The total number of groups in the network is 

]T h * = 2n + ° 

i€ Nodes 

Definition 4.9 The constant tw is the consistent traffic volume on the di- 
rected line from node k to node I. 

Definition 4.10 The constant cj n is the sum of all consistent traffic volumes 
for external traffic into node i. 

Definition 4.11 The constant c|£ is the sum of all consistent traffic volumes 
for external traffic which belongs to group a into node i. 

Definition 4.12 The constant cf ut is the sum of all consistent traffic vol- 
umes for external traffic out of node i. 

Definition 4.13 The constant cf£* is the sum of all consistent traffic vol- 
umes for external traffic which belongs to group a out of node L 

Lemma 4.3 We have that 

V ie Nodes: c j n = £ c£ 

and 

V ie Nodes: C = £ c £* 

Definition 4.14 The constant R k i is the set of all node pairs ij so that the 
flow from node i to node j is (partially) routed through line kl 
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Definition 4,15 The constant r-j 1 is a number between 0 and 1 which de- 
scribes which fraction of the flow between nodes i and j is routed through line 
kl. A value 0 indicates that the flow is not routed through line kl. 

Definition 4.16 The constant p{j q is either 0 or 1 and indicates whether 
the flow between nodes i and j is a flow between PoPs p and q, i.e. i 6 p and 
3 € g. 

Lemma 4.4 We have that 

V i,j € Nodes: £ tfj = 1 

p€ Pops 
q 6 Pops 

holds for all nodes i and j. 
4.2 Variables 

We now describe the variables of our model. The first set, the flow variables, 
are used to define flows in the network consistently. The second set splits the 
flows in even more details, so that we can attach, more detailed constraints. 
But it is the last set, the solution variables, that we are really interested in. 
These solution variables are defined as sums of flow variables. 

Definition 4-17 The non-negative variable fy describes the flow from node 
i to node j: These variables are called flow variables. 

There are n 2 variables of this type. 

Definition 4.18 The non-negative variable gg b describes the flow from group 
a of node i to group b of node j. These variables are called flow group vari- 
ables. 

Lemma 4.5 There are 4n 2 + 4on + o 2 variables gff. 

Definition 4.19 The non-negative variable s pq describes the flow from PoP 
p to PoP q. These variables are called solution variables. 

There are r 2 variables of this type. 
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4-3 Constraints 

We now list the constraints used in our model. The nam^s of the constraints 
are taken from the "RiskWise II Constraint Model" document. 

Constraint 4.1 (flow_external(2)) The constraint states that the sum of 
all flows starting in a node is equal to the sum of all external traffic entering 
the node, 

V i€ Nodes: £ fa = c? 

je Nodes ' 

We have n constraints of this type. 

Constraint 4.2 (flow_external(l)) The constraint states that the sum of 
all flows ending in a node is equal to the sum of all external traffic leaving 
the node. 

V j€ Nodes: £ fa = cf* 

ie Nodes 

We have n constraints of this type. 

Constraint 4.3 (traffic-contribution-equal) The constraint states that 
the sum of all flows through a line is equal to the traffic on the line. 

V kle Lines : ^2 r ij * = tkl 
We have m constraints of this type. 

Constraint 4.4 (solution_term(l)) The constraint states that the flow be- 
tween two PoPs is equal to the sum of all flows between nodes which belong 
to the first and second PoP. 

V p, q € Pops : s pg — ^2 /« 



We have r 2 constraints of this type. 

Constraint 4.5 (flowXgJum) The total flow between two nodes is equal 
to the sum of the flows between the interface groups at the nodes. 

V i e Nodes J e Nodes : fa- 9ij 

\<b<hj 




There are n 2 constraints of this type. 

Constraint 4.6 (flow_gJimit a) The sum of all flows from an interface 
group at a node is equal to the consistent traffic into this group. 

V ieNodes,l<a<hi: c£ = £ g$ 

j erodes 
\<b<hj 

There are 2n + o constraints of this type. 

Constraint 4.7 (flow-gJimit b) The sum of all flows to some interface 
group at a node is equal to the consistent traffic out of this group. 

V j eNodes,l<b<h r . c% L = ]T $ 

i£ Nodes 
l<a<hi 

There are 2n + o constraints of this type. 

Constraint 4.8 (empty Jlow) There are no flows between interconnection 
lines. 

V ie Nodes, j G Nodes, 3 < a < h { ,3 < b < hj : g$ = 0 

There are o 2 constraints of this type. These constraints axe solved by pre- 
processing, setting some variable to zero. 

Constraint 4.9 (inverse^g Jlow) The flow between nodes i andj is limited 
by the flow in the inverse direction. 

V i e Nodes, j e Nodes, 1 < a < h u 1 < 6 < h s : g$ < L * g *f 

There are 2n + o constraints of this form. 

Constraint 4.10 (self _flow_net) There is no flow within a network node. 

V i e Nets : /« = 0 

There are n„ constraints of this form. These constraints are solved by pre- 
processing, setting some variable to zero. 

Lemma 4.6 Constraint 4-1 and 4-2 are implied by the constraints 4.5, 4.6 
and 4.1 and are therefore redundant. 
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Proof: We show that constraint 4.1 is implied by adding up all equations 4.6 
for a node i: 

l<a<hi 

Constraint4.6 v— ^ nb 



E E «tf 



Ka<ht i€iVod« 
— - *<*><hj 
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4.4 Matrix 

2 The matrix form of the problem is defined by the set of equations and 
inequalities 
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2 This section can be skipped on a first reading. 
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The last 2n 4- o lines axe inequalities, the rest are equations. The constraints 
are arranged in the sequence 

1. flow_external(2) 

2. flow.external(l) 

3. traffic-contribution-equal 

4. solution_term(l) 

5. flow_f_g_sum 

6. flow_g_limit a 

7. flow_gJimit b 

8. inverse_g_flow 

The matrix M is defined as 



/ ONEi ONE 2 • • • ONE n 0 0 \ 

/ I ..- I 0 0 

Rr R 2 ••' Rn 0 0 

Fx Pi Pn -I 0 



-/ OH 
0 G 
0 G 




The matrix has 2n + m + r 2 + n 2 + 3(2n + o) lines and n 2 + r 2 + (2n + o) 2 
columns. We now describe the different sub parts of the matrix. 



/ 0 0 ••• 0 \ 



ONE x = 



0 0 

1 1 
0 0 



0 
1 
0 



V o o ... .o y 



where the line of ones is line x. The size of this matrix is n x n. 
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0 1 ••• 0 
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\ 0 0 



1 / 



42 



Rx — 


( rft 1 ' 
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r k m l m 


The size of this matrix ismxn. 
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The size of this matrix is r 2 x n. 

Lemma 4.7 We find that P x = P y ifpop(x) = pop(y) 
The sub matrix H has the form 
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in 



if = 



IT 



V o 

with n 2 lines and (2n + o) 2 columns. 



Tu y 



Definition 4.20 Let g x : i x a i-> x be the bijection which maps tuples ix a 
to an index x. 

Let g y :ixjxaxbi-4y be the bijection which maps tuples i x j x ax b 
to an index y. 

Let g~ l and g~ l be their inverses. 

The matrix G consists of elements g xy which are defined by the formula 

l (x) = i x a and g y l (y) — ixjxaxb 



9xy 



= f 1 if - — *y 

\ 0 otherwise 



The matrix G consists of elements g xy which are defined by the formula 

- f i if 9ZK 

3xv \ 0 otherwis 



= j x b and = i x j x a x b 

otherwise 
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Finally, the matrix V consists of values 0, 1 and — L. Each line and column 
contains one entry with value 1 and one entry with value -L. The values 1 
are on the main diagonal. Formally, the matrix entries are defined as 

'1 if x = y and g~ x {x) ^ixixaxa 
—L if x ^ y and 
"xy=< 9y 1 {x) = i*3 xax& and 

Pv 1 (y) = i>< ix6xa 

0 otherwise 
4.5 Objective Functions 

As result of the flow analysis we want to have lower and upper bounds for 
all solution variables. This implies the objective functions 

min s pg 

and 

max Sp, 

which are run independently for all pairs of PoPs p and q. This means that 
we have to run 2 * r 2 optimization queries. We can reduce this value by 
checking if a particular solution variable Sp, has a value 0 in one of the other 
optimization runs. If that happens, we do not have to check it for minimum 
value. 
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FULL CONSTRAINT MODEL INCLUDING ERROR 
CORRECTION 



We now present the full constraint model including error correction. 



5.1 Constants 

We start by defining the constants that we use to describe the network, its 
routing and the traffic loads in the network. 

Definition 5.1 Nodes is the set of all nodes in the network. The number 
of nodes is denoted by n. Indices i, j, k, k Xs I and l x refer to nodes. 

Definition 5.2 Routers is the set of all routers in the network. The num- 
ber of nodes is denoted by n r . 

Definition 5.3 Nets is the set of all network nodes in the network. The 
number of nodes is denoted by n n . . 

Lemma 5.1 We have 



Definition 5.4 Pops is the set of all PoPs in the network. The number of 
PoPs is denoted by r. Indices p and q refer to PoPs. Bach PoP is a set of 
nodes, and each node belongs to exactly on PoP. 

Definition 5.5 The function pop : i p maps the node index i to a PoP 
index p. 

Definition 5.6 Lines is the set of all node pairs kl so that a directed line 
exists between nodes k and I in the network. The number of lines in the 
network is denoted by m. 



Nodes = Routers U Nets 



and 



Routers D Nets = 0 



Definition 5.7 The value o denotes the number of interconnection lines in 
the network. 



45 



Definition 5-8 The constant h 5 denotes the number of interface groups 
found at node i. The value hi is equal to the number of interconnection 
lines attached to node i plus 2. 

Lemma 5,2 The total number of groups in the network is 

52 hi = 2n 4- o 

ieNodes 

Definition 5.9 The constant ufS urce is the observed traffic volume entering 
the directed line from node k to node I 

Definition 5.10 The constant u£f 8t is the observed traffic volume leaving 
the directed line from node k to node I. 

Definition 5.11 The constant a£ is the sum of all observed traffic volumes 
for external traffic which belongs to group a into node i. 

Definition 5.12 The constant ag* is the sum of all observed traffic volumes 
for external traffic which belongs to group a out of node i. 

Definition 5.13 The constant R kl is the set of all node pairs ij so that the 
flow from node i to node j is (partially) routed through line kL 

Definition 5.14 The constant rjj 1 is a number between 0 and 1 which de- 
scribes which fraction of the flow between nodes i and j is routed through line 
kL A value 0 indicates that the flow is not routed through line kL 

Definition 5.15 The constant pg q is either 0 or 1 and indicates whether 
the flow between nodes i and j is a flow between PoPs p and q, i.e. i €p and 

j 6 <l- 

Lemma 5.3 We have that 

V ije Nodes: £ p?? = 1 

q 6 Pops 



holds for all nodes i and j. 
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5.2 Variables 

We now describe the variables of our model. The first set, the flow variables, 
are used to define flows in the network consistently. The second set splits the 
flows in even more details, so that we can attach more detailed constraints. 
But it is the last set, the solution variables, that we are really interested in. 
These solution variables are simply defined as sums of flow variables. 

Definition 5-16 The variable vgj urce is the error correction applied to the 
observed traffic volume entering the directed line from node k to node 1. This 
variable may be negative. 

There are m variables of this type. 

Definition 5-17 The variable v£f st is the error correction applied to the 
observed traffic volume leaving the directed line between nodes i and j. This 
variable may be negative. 

There are m variables of this type. 

Definition 5-18 The non-negative variable tki is the consistent traffic vol- 
ume on the directed line between nodes i and j. 

There are m variables of this type. 

Definition 5.19 The non-negative variable cj" is the sum of all consistent 
traffic volumes for external traffic which belongs to group a into node %. 

There are 2n + o variables of this type. 

Definition 5.20 The non-negative variable cf^ is the sum of all consistent 
traffic volumes for external traffic which belongs to group a out of node i. 

There are 2n 4- o variables of this type. 

Definition 5.21 The variable e£ is the error correction applied to the sum 
of all consistent traffic volumes for external traffic which belongs to group a 
into node i. This variable may be negative. 

There are 2n + o variables of this type. 

Definition 5.22 The variable ef a ut is the error correction applied to the sum 
of all consistent traffic volumes for external traffic which belongs to group a 
out of node i. This variable may be negative. 
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There are 2n + o variables of this type. 

Definition 5.23 The non-negative variable fy describes the flow from node 
i to node j. These variables are called flow variables. 

There are n 2 variables of this type. 

Definition 5.24 The non-negative variable gg b describes the flow from group 
a of node i to group b of node j. These variables are called flow group vari- 
ables. 

Lemma 5.4 There are 4n 2 + 4on -h o 2 variables g$. 

Definition 5.25 The non-negative variable s pq describes the flow from PoP 
p to PoP q. These variables are called solution variables. 

There are r 2 variables of this type. 
5.3 Constraints 

We now list the constraints used in our model. The names of the constraints 
are taken from the "Risk Wise II Constraint Model" document. 

Constraint 5.1 (interface_error_correction a) The corrected traffic vol- 
ume value is equal to the observed value plus the error correction applied. 

V i <E Nodes, 1 < a < h { : ?> = a£ + e£ 

There are 2rc + o constraints of this type. 

Constraint 5.2 (interface_error_correction b) The corrected traffic vol- 
ume value is equal to the observed value plus the error correction applied. 

V i e Nodes, l<a<hii c% 1 = a£ l + e% 1 

There are 2n + o constraints of this type. 

Constraint 5.3 (interface_error.correction c) The corrected traffic vol- 
ume value on a line is equal to the observed value plus the error correction 
applied. 

V kle Lines: hi = </ urce + v% UTCe 
There are m constraints of this type. 
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Constraint 5.4 (interface-error_correction d) The corrected traffic vol- 
ume value on a line is equal to the observed value plus the error correction 
applied. 

V kl e Lines : t kl = u^ si + v$ 3t 
There are m constraints of this type. 

Constraint 5.5 (no decoct et^sum) The sum of all corrected traffic in to 
a node is equal to the sum of all corrected traffic out of the node. 

V i€ Nodes: £ e£ + £ t„ = £ g£*+ £ t a 

l<o</i< fct^Lines l<a</i» U^Lines 

There are n constraints of this type. 

Constraint 5.6 (traffic-.contribution_equal) The constraint states that 
the sum of all flows through a line is equal to the traffic on the line. 

y kl e Lines : ]T] r$j * fa = i kl 
We have m constraints of this type. 

Constraint 5.7 (solutionJ;erm(l)) The constraint states that the flow be- 
tween two PoPs is equal to the sum of all flows between nodes which belong 
to the first and second PoP. 

V p, q <= Pops : s Pg = Yl fa 
We have r 2 constraints of this type. 

Constraint 5.8 (flowJLg_suni) The total flow between two nodes is equal 
to the sum of the flows between the interface groups at the nodes. 

V i G Nodes J e Nodes : f {j = £ $ 

l<b<hj 

There are n 2 constraints of this type. 

Constraint 5-9 (flow_g_limit a) The sum of all flows from an interface 
group at a node is equal to the consistent traffic into this group. 

V ieNodes % \ <a<h { : c£ = £ . $ 

j€ Nodes 
l<b<hj 
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There are 2n + o constraints of this type. 

Constraint 5.10 (flow^gJimit b) The sum of all flows to some interface 
group at a node is equal to the consistent traffic out of this group. 

V j €Nodes,l<b<hj: 2$*= £ g# 

i£ Nodes 
1 <<•</»{ 

There are 2n + o constraints of this type. 

Constraint 5.11 (empty Jow) There are no flows between interconnec- 
tion lines. 

V i € Nodes, j e.Nodes, 3 < a < h u 3 < b < h 5 : g?-=0 

There are o 2 constraints of this type. These constraints are solved by pre- 
processing, setting; some variable to zero. 

Constraint 5.12 (inverse-g -flow) The flow between nodes i and j is lim- 
ited by the flow in the inverse direction. 

V i e Nodes J e Nodes, 1 < a < h u 1 < b < hj : g$ <L*9j} 
There are 2n + o constraints of this form. 

Constraint 5.13 (self-flow .net) There is no flow within a network node. 

V ie Nets : fu = 0 

There are n n constraints of this form. These constraints are solved by pre- 
processing, setting some variable to zero. 

5.4 Objective Functions 

The error correction model first performs an optimization to find a minimal 
error correction which provides us with a consistent data set to perform the 
flow analysis. The objective function for the error correction is 

min &x* £ |e£| + 6x* £ |cff'| + (5) 

i€Nodes izNodes 

6 2 * E Kr rce \ + b2* E («> 

kleLines ktGLines 

bz* E % + E 3i ut W 
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(8) 



(9) 



(10) 



As result of the flow analysis we want to have lower and upper bounds 
for all solution variables. This implies the objective functions 

min s pg 

and 

max s pq 

which are run independently for all pairs of PoPs p and q. This means that 
we have to run 2 * r 2 optimization queries. We can reduce this value by 
checking if a particular solution variable s pq has a value 0 in one of the other 
optimization runs. If that happens, we do not have to check it for minimum 
value. 



&4* £ g$ + 



\<a<2 
Kb<2 



h* J2 9$ + 



j£No<Us 
l<*<hi 
Z<b<hj 



6s* £ 



Nodes 
3<o<A i 
l<b<2 



# 
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5 . 5 Use of Variables / Constants 

The following table shows the use of the different variable and constant types 
in the constraints and the objective functions. 
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Optimizer Model 

In the following we describe the constraint model for the traffic flow optimizer 
according to one embodiment of the present invention. The model is based on the 
full constraint model including error correction. 

For the optimizer model we always study two networks scenarios. One is the 
original scenario, in which we have made traffic measurements, the other is a 
modified one. The modifications can be an arbitrary number of additions, 
changes or deletions from the original network topology or the network topology. 
For the modified scenario we want to calculate certain indicator values, which tell 
us about the impact of the modification. In the following formalism, all symbols 

related to the modified scenario are marked with a bar, like so: f u . 
6.1 Constants 

We start by defining the constants that we use to describe the modified 
network, its routing and the traffic loads in the network. We only define 
those items that we will need in the model. 



Definition 6.1 Nodes is the set of all nodes in the network. The number 
of nodes is denoted by n. Indices i, j, k, k X} I and l x refer to nodes. 

Definition 6.2 Routers is the set of all routers in the network. The num- 
ber of nodes is denoted by n r . 

Definition 6.3 Nets is the set of all network nodes in the network. The 
number of nodes is denoted by n n . 

Lemma 6.1 We have 



Nodes = Routers U Nets 

and 

Routers n Nets = 0 

Definition 6.4 Lines is the set of all node pairs kl so that a directed line 
exists between nodes k and I in the network. The number of lines in the 
network is denoted by m. 

Definition 6.5 The constant R k i is the set of all node pairs ij so that the 
flow from node i to node j is (partially) routed through line kl. 

Definition 6.6 The constant rjj 1 is a number between 0 and 1 which de- 
scribes which fraction of the flow between nodes i andj is routed through line 
kl. A value 0 indicates that the flow is not routed through line kl. 
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Definition 6.7 The constant bw k i describes the bandwidth of the line kl. 

Definition 6.8 The constant ncy is the routing cost of a flow from node 
i to node j. The value indicates how much network resources are used to 
transport the flow from node i to node j. 

Lemma 6.2 

V ij € Nodes : nc 0 - = ^3 f ij 

kl^ Lines 

Lemma 6.3 

V i <E Nodes : nca = 0 

V iyj € Nodes, j : ndj > 1 

Observation: Normally, the value ncij is an integer, but this is not always 
the case. 

Definition 6.9 The constant wy is the factor of increase of the flow from 
node i to node j from the original network to the modified network. 

Definition 6.10 The constant zy is an additional flow from node i to node 
j in the modified network. 

Definition 6.11 The constant JJ describes the utilization limit for all lines 
in the network. 

Definition 6.12 The constant L is the set of all node pairs ij so that the 
flow from i to j can not be routed in the modified network. 

6.2 Variables 

Definition 6.13 The non-negative variable t k! is the consistent traffic vol- 
ume on the directed line between nodes k and I. 

There are m variables of this type. They are not explicitly represented 
in the model. 

Definition 6.14 The non-negative variable fy describes the flow from node 
i to node j. These variables are called flow variables. 

There are n 2 variables of this type. They are not explicitly represented 
in the model. 
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Definition 6,15 The non-negative variable d^i is the utilization (in percent) 
of the directed line between nodes k and L 

There are fh variables of this type. 

Definition 6.16 The non-negative variable D is the overall utilization (in 
percent) of the modified network. 

There is only one variable of this type. 

Definition 6.17 The non-negative variable d min is the minimal utilization 
of any line in the network. 

There is only one variable of this type. 

Definition 6.18 The non-negative variable d max is the maximal utilization 
of any line in the network. 

There is only one variable of this type. 

Definition 6.19 The non-negative variable 5ki describes the percentage of 
overloading of line ki If the line is not overloaded, the value is 0. 

There are fh variables of this type. 

Definition 6.20 The non-negative variable h^i describes the volume of over- 
loaded traffic on line ki If the line is not overloaded, the value is 0. 

There are fh variables of this type. 

Definition 6.21 The non-negative variable O describes the total amount of 
overloading in the network. 

There is only one variable of this type. 

Definition 6.22 The non-negative variable H describes the total volume of 
overloaded traffic in the network. 

There is only one variable of this type. 

Definition 6.23 The non-negative variable Lost describes the volume of 
traffic in the original network that can not be routed in the modified network. 
If all traffic can be routed, the value is 0. 

There is only one variable of this type. 




6-3 Constraints 

Constraint 6.1 (traffic_contribution_equal) The constraint states that 
the sum of all flows through a line is equal to the traffic on the line. 

V kl € Lines : ]T f ij * = * kl 

We have m constraints of this type. They are not explicitly represented in 
the model. 

Constraint 6.2 The constraint states that the utilization is the, traffic di- 
vided by the bandwidth. 

V kl € Lvnes : d kl = 100 * 7^- 

bw k t 

The constraints are not explicitly represented in the model. 

Constraint 6.3 The overall utilization is calculated by the total traffic di- 
vided by the sum of all bandwidth in the network. 

g hi 

5 = 100* kl ^ in '° 

2L, bw ki 

kl£ Lines 

The constraint is not explicitly represented in the model. 

Constraint 6.4 (flow-assumption) The constraint states how flows in the 
original and in the modified network are related. 

V i,j€ Nodes : fij = w i5 * /y + z {j 

The constraints are not explicitly represented in the model. 

Constraint 6.5 The lost traffic is the sum of all flows of the original net- 
work that can not be routed in the modified network. 

Lost = h 

Constraint 6.6 The amount of overload for a line is the percentage of uti- nonlinear 
lization exceeding the utilization limit. 

V kl € Lines : d kl = max(0, d kl - U) 
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The constraints are not explicitly represented in the model. 

Constraint 6.7 The total amount of overload is the sum of the overload on nonlinear 
each line. 

o= E °«i 

kl£Lines 

The constraint not explicitly represented in the model. 

Constraint 6.8 The volume of overloading of a line is the volume of the nonlinear 
traffic exceeding the utilization limit 

V kle Lines : h kt = max(0, t k i ~ — i^*' ) 
The constraints are not explicitly represented in the model. 

Constraint 6.9 The total volume of overloading is equal to the sum of the nonlinear 
overloaded volumes on each line. 

klGLines 

The constraint is not explicitly represented in the model. 

Constraint 6.10 The minimal utilization is the minimum of all line utiliza- nonlinear 
tions in the network. 

dmin = min {d k i\ 

klQ Lines 

The constraint is not explicitly represented in the model. 

Constraint 6.11 The maximal utilization is the maximum of all line uti- nonlinear 
lizations in the network. 

dmax = max {dki\ 

kl^ Lines 

The constraint is not explicitly represented in the model. 
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6.4 Approximations 

Constraints 6.6 to 6.11 are non-linear. Linear approximations can be denned 
as follows. Variables defined through the linear approximation are marked 
with '. We typically will be able to calculate such approximations more easily 
in our program. 

Constraint 6.12 replaces constraint 6.6 

V kl 6 Lines : d' kl > d kl -U (U) 
o' kt > 0 (12) 

Constraint 6.13 replaces constraint 6.7 

0'= g o' kl 

kleLines 

Constraint 6.14 replaces constraint 6.8 

- U * bwki /iqn 

V ki€ Lines : h! kl > t kl ^oq~ 1 } 

h' kl > 0 " (14) 

Constraint 6.15 replaces constraint 6.9 

kleLines 

Constraint 6.16 replaces constraint 6.10 

V Lines : <? min < 
Constraint 6.17 replaces constraint 6.11 

V A:Z € Lines : d^ax ^ ^« 

6.5 General Results 

Theorem 6.1 The utilization of a line in the modified network can be ex- 
pressed as a linear function of node to node flows in the original network. 
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Proof: 

V kl € Tmes : d kt = 100 * 7^- ' (15) 

bw k i 

100 t_ _ kl - , x 

= ST" E r «-^ ( 16 

= ^ E f^Wo + zy) (17) 
_ 100*f*V + ■ 

- ..^ K + E — 65^ — fij{18) 

yeRki _ fi6fi w > l^f , 

~ " constant 
constant 

Theorem 6.2 27ie overall network utilization in the modified network can be 
expressed as a linear function of node to node flows in the original network. 



Proof: 



g hi 

D = 100 * *'ff;' ne '. (19) 

g frwfcf ' ' 

fc/e Lines 

100 E E fg/« (20) 



kl^ Lines 

100 



«6itne« »J Sfiu 



v s E E *tf + %) (21) 

Z±_ UW M kleLinesijZRki 
kle Lines 

kl^Lines 

100 ^ , 

kle Lines 

V 10 ° 6 - E_ g_^i (23) 

kl^ Lines 

4±_ klefAncsijeNodea 
kleLines 
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100 



v E E_^ M 



hi £ Lines 



E 



E ^fci 

E ^i^i ( 25) 



kl£ Lines 

100 



ij 6 Nodes 



kl^Lines 



constant 



+ 4i_ v * fii 



kle Lines 



constant 



Lemma 6.4 



5Z *«= 21 ncij+fij 

kl e Lines i J€ iVodes 



Intuitively, the lemma states that the traffic in the network is completely 
explained in terms of the node to node flows, and that the contribution of a 
flow to the overall traffic is its volume multiplied by the routing cost of the 
flow. 

Theorem 6.3 Assume the same topology and routing in the original and the 
modified network. If the flows in the modified network are multiples of the 
original flows by the same factor, then we will forecast a growth of utilization 
on each line and of the overall u tilizatio n by the sa me fac tor. 

Formally, assume Nodes = Nodes, Lines = Lines, bw kl = bw kl and 
r ki _ f ki F ur ther assume fa = x * Then we have that d ki = x * d kl and 
D = x*£>. 

Proof: Under the given assumptions, we have that 

D = T. 100 * x * fa (26) 
ij&nzrs E bw " 

kle Lines 

g ncjj * fa 

. = g »iro* w ";g «- (27) 

kl€ Lines 



I 



60 



= 100*1^^ (28) 

kleLinea 

E **< 

= x*100* fc '|f;'" ea (29) 

kleLines 

= x*D (30) 

Similarity, we have that 

V kl e Lines : d kl = 100 * (31) 

bw ki 

= 100 (32) 

E rtj + x + fij 
= 100*^^— (33) 

EH ! •/« 

. = x*l_00*.^^ - (34) 

= x*100*T^- (35) 
= x * dju (36) 

Corollary 6.1 .Assume £fte same topology, routing and flows in the original 
and in the modified network. Then the forecasted line utilization will be equal 
to the corrected utilization in the original network. 

Theorem 6.4 Assume the same flows in the original and the modified net- 
work. IfnCij < ncij for all flows f then D < D. 

6.6 Approximation Results 

We now present some results on the linear approximations defined above. 
We typically can use them to find either the minimum or maximum value of 
the approximated value, but not to find both. 

Theorem 6.5 We can use the linear approximation d f kl to find the minimum 
value for the variable o*|. 



V kl € Lines : min 5m = min o' kl 

LP 
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The maximum value of o' kl is unbounded and therefore useless. Butwe can 
get the maximal value directly from 

Theorem 6.6 

V kl e Lines : max o kl = max(0, max d' kl - U) 

Theorem 6.7 We can use the linear approximation O' to find the minimum 
value for the variable O. 

minO = minO' 

LP 

The maximum value of O* is unbounded and therefore useless. 

Theorem 6.8 We can find an upper bound for the maximum value of O by 
the sum of the maxima of the o k i values. 

maxO < maxou 

klSLines 

The maximum value for d kl can be determined if we know the maximum 
value for each d k i. 

Theorem 6.9 

max V (dki -U)< maxO 

LP 

kl£ Lines 

This is a fairly trivial bound, and typically will be rather weak. We can 
strengthen the bound by restricting the sum to those lines that can be over- 
loaded in the worst case. 
Theorem 6.10 

max Y, fat-V) <max6 
LP fcls.t.a fc i>o 

This again assumes that we know the maximum value for each of the d k t 
variables. 

We can define similar results for the other approximations: 
Theorem 6.11 

mindmax = rnind^ ax 
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Theorem 6.12 



Theorem 6.13 



Theorem 6.14 



Theorem 6.15 



Theorem 6.16 



Theorem 6.17 



Theorem 6.18 



Theorem 6.19 



max d max = max (max ok/ 1 

kHz Lines LP 



max d min = max d' min 



max d min = min {min J w } 

kle Lines LP 



V fc/ e Lines : min ft w = min h' kl 

LP *' 



min H = min 5' 

LP 



max if < moixhki 

*=± LP 

kl^ Lines 



(*jk£ — - — ) < max H 

«r ^ _ V 100 ^ - 

kl^Lines 



max (*w 7H77-) < max # 

The following tables summarize the approximation results. For each 
value, they state how many LP queries must be processed. A value m indi- 
cates that all line utilizations have to be computed. As a reminder, we also 
give a brief statement of the meaning for each indicator. 
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Variable(s) 


LP queries 


Meaning 


mini? 


1 


the minimum overall utilization of the net- 
work for any flow distribution 


1 max D 


1 


the maximum overall utilization of the net- 
work for any flow distribution 


minO 


1 


minimum total amount of overloading in the 
best case 


maxO 


m + 1 


approximation only bounds for the largest 
percentage of overloading in the worst case 
scenario 


| min if 


1 


minimum total volume of overloading in the 
Dest case 


| max if 


m + 1 


approximation only bounds for the largest 
volume of overloading in the worst case sce- 
nario 


1 min Lost 


1 


the minimum volume of all flows of the orig- 
inal network that can not be routed in the 
modified one 


1 max Lost 


1 


the maximum volume of all flows of the orig- 
inal network that can not be routed in the 
modified one 


L 


none 


the flows of the original network that can 
not be routed in the modified one 


1 min d m ax 


1 


the maximum utilization of any line under 
the best assumptions about the flow distri- 
bution 


[ max d m ax 


m 


the maximum utilization of any line for the 
worst case scenario 


1 min d m in 


m 


the minimum utilization of any line in the 
worst case scenario 


1 max Gf m in 


1 


the minimum utilization of any line under 
the best assumptions about flow distribtion 
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Vanable(s) 


LP queries 


Meaning 


min tki 


m 


minimum traffic volume for each line 


max tki 


m 


maximum traffic volume for each line 


min d k i 


m 


minimum utilization of each line (follows 
from t kl ) 


max c?jti 


m 


maximum utilization of each line (follows 
from tki) 


min df-i 


m 


minimum amount of overloading for each 
line (follows from i kl ) 


max 


m 


maximum amount of overloading for each . 
line (follows from t k i) 


min hki 


m 


minimum volume of overloading for each line 
(follows from iki) 


maxhki 


m 


maximum volume of overloading for each 
line (follows from i k i) 



6.7 A Simpler Approximation 

If we are satisfied with lower and/or upper bounds on the values for t kl , we 
_can use the following approximation... The traffic on a line kl in the modified 
network can be expressed as 

i,j€Nodes 

Assuming that the set of nodes and the flows between them have not changed, 
we have that 

*« = E 

ijeNodea 

On the other hand, the traffic on a line k'V in the original network can be 
expressed as 

*r- E rf/* fij 

iJeNodes 

If for all nodes i and j we have that 

V ije Nodes: f£^r£'' 
we can bound with t^i*. 

_ < 
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With less justification, we could use the approximation 

^ source 

to find a simple a priori bound which only requires routing and some lookup 
routine, but does not require an LP run. 

This method of approximation can be extended to sums of traffic on mul- 
tiple lines in the original network, which can be used to bound the traffic on 
a line in the modified network. This may be useful if we are only interested in 
whether some line is overloaded or not, but do not care for actual utilization 
numbers. 

6.8 Forecast 

We now look at the different types of queries we encounter in the different 
optimizer modules. For each particular form of query, we note which variables 
can be used to answer it. 

Given a topology, traffic data and a flow growth pattern. 

Possible queries to answer are: 



6.8-1 What is the utilization of each line? 



Variable (s) 


Comment 


min / max d k \ 


needs 2m queries 



6.8.2 Are there any lines that are overloaded? 



Vaxiable(s) 


Comment 


O>0? 


needs 2m queries ? 



6.8.3 What is the total utilization of the network? 



Variable(s) 


Comment 


min / max D 


2 queries 



6.9 Resilience 

Given a topology, traffic data, a flow growth pattern and a set of network 
changes; Possible queries to answer are: 
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6.9.1 For each change, what is the utilization of each line and 
which flows are lost? What is the volume of each flow lost? 



Variable (s) 


Comment 


m.in / max dki 


needs 2m queries per change 


L 


obtained from routing 


V %j g L : min / max fij 


loss of precison probable; 2\L\ queries 
per change 



6.9.2 For each change, what is the utilization of each line and 
which flows are lost? What is the total volume of the flows 
lost? 



Variable (s) 


Comment 


min / max dki 


needs 2m queries per change 


L 


obtained from routing 


min / max Lost 


2 queries per change 



6.9.3 For each change, what is the utilization of each line and 
which flows are lost? 



-Variable(s) 


Comment 


min / max d ki 


needs 2m queries per change 


L 


obtained from routing 



6.9.4 For each change, are there overloaded lines and which flows 
are lost? 



Variable (s) 


Comment 


min / max O 


needs 2m queries per change 


L 


obtained from routing 



6.9.5 For each change, what is the overall utilization of the net- 
work? 



Variable(s) 


Comment 


min/ max D 


2 queries per change 


6.9.6 Is there a change so that some line is overloaded? 


Variable (s) 


Comment 


max{maxO} 


m + 1 queries per change 
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6.9.7 Is there a change so that some flow is lost? 



Variable(s) 


Comment 




obtained from routing 



6.9.8 What is the worst case utilization of the network for any of 



the changes? 



Variable(s) 


Comment 


max{maxD} 


2 queries per change 



6.9.9 What is the largest amount of traffic lost for any of 
changes? 



Variable(s) 


Comment ] 


max{max Lost} 


2 queries per change 1 



6.10 Planner 

Given a topology, traffic data, a flow growth pattern and a set of possible 
network changes. Possible queries to answer are: 

6.10.1 For each change, what is the utilization of each line? 



Variable(s) 


Comment 


min / max dki 


needs 2m queries per change 



6.10.2 For each change, which lines are overloaded by how much? 



Variable (s) 


Comment 


j min / max dki 


needs 2m queries per change 



6.10.3 For each change, are there overloaded lines? 



Variable(s) 


Comment 


O >0? 


needs 2m queries per change ? 



6.10.4 Which change results in the smallest amount of overload- 



ing? 



Variable (s) 


Comment 


min {max O] 


needs 2m queries per change ? 




6,10.5 Which change results in the smallest overall utilization? 



Variable(s) 


Comment 


min{£>} 


2 queries per change 



6.10.6 Which is the subset of n changes which reduces overloading 
the most? 

6.10.7 Which n changes to select one at a time to reduce over- 
loading? 

6,11 Resilient Planner 

Given a topology, traffic data, a flow growth pattern, a set of possible network 
failures and a set of possible network changes. Possible queries to answer are: 

6.11.1 For each change, is the resulting network resilient and what 
is the utilization of each line? 



6.11.2 For each change, is the resulting network partially resilient 
and what is the utilization of each line? 



6.11.3 Which change that does not worsen resilience results in 
the smallest amount of overloading? 



6.11.4 Which change that does not worsen resilience results in 
the smallest overall utilization? 



Partially resilience is defined compared to an existing situation. A net- 
work change is partially resilient compared to an original network if no ad- 
ditional line becomes overloaded and no additional traffic is lost. 
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A Scalable Version of the Constraint Model 

In a further embodiment of the present invention, we describe a traffic flow 
constrS model using additional approximations. In this way the model is 
especially useful for large networks. 

The models represented so far all have the disadvantage that we need 0(n Vode- 
to-node flow variables. This is a basic limitation that restricts the number of 
networics foT which the models can be used. We now present a way of relaxing 
Ae model iso L we need fewer variables. The key idea is to introduce flow 
variablesfor only some of the node-to-node flows and to approximate the 
re^ainLg flows. This will lead to weaker results, but it can be scaled to larger 
size by selecting fewer and fewer detailed flows. 

Changes against the minimal model are marked with an arrow in the margin. 



B.l Constants 

We start by denning the constants that we use to describe the network, its 
routing and the traffic loads in the network. 

Definition B.l Nodes is the set of all nodes in the network. The number 
of nodes is denoted by n. Indices i, j, k, k x , I and l x refer to nodes. 

Definition B.2 Pops is the set of all PoPs in the network The number of 
PoPs is denoted by r. Indices p and q refer to PoPs. Each PoP « a set of 
nodes, and each node belongs to exactly one PoP. 

Definition B.3 The function pop : i p maps the node index i to a PoP 
index p. 

Definition B.4 Lines is the set of all node pairs kl so that a directed line 
exists between nodes k and I in the network. The number of hnes m the 
network is denoted by m. 

Definition B.5 The constant t kl is the consistent traffic volume on the 
directed line between from node k to node I 

Definition B.6 The constant c[ n is the sum of all consistent traffic volumes 
for external traffic into node i. 

Definition B.7 The constant c? ut is the sum of all consistent traffic volumes 
for external traffic out of node i. 
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Definition B.8 The constant R k i is the set of all node pairs ij so that the 
flow from node i to node j is (partially) routed through line kl. 

Definition B.9 The constant rH 1 is a number between 0 and 1 which de- 
scribes which fraction of the flow from node i to node j is routed through line 
kl. A value 0 indicates that the flow is not routed through line kL 

Definition B.10 The constant pg q is either 0 or 1 and indicates whether 
the flow from node i to node j is a flow from PoP p to PoP q, i.e. i <Zp and 

3 tq- 

Lemma B.l We have that 

V i, j e Nodes : £ p?? = 1 

pGPops 
qGPops 

holds for all nodes i and j. 

Definition B.ll The constant Taken is a set of node pairs which represents 
the set of all selected node-to-node flows. We have that Taken C Nodes x 
Nodes. Entries in Taken are represented by. node. pairs ij 

Definition B.12 The constant Find is a set of PoP pairs which represents 
the set of all selected PoP-to-PoP flows. We have that Find C Pops x Pops. 
Entries in Taken are represented by PoP pairs pq. 

B.2 Variables 

We now describe the variables of our model. The first set, the flow variables, 
are used to define flows in the network consistently. It is the second set, the 
solution variables, that we are really interested in. These solution variables 
are defined as sums of flow variables. 

Definition B.13 The non-negative variable f u describes the flow from node 
i to node j with ij € Taken. These variables are called flow variables. 

There are \Taken\ variables of this type. 

Definition B,14 The non-negative variable restf ut describes the sum of all 
node-to-node flows starting in node i which are not in the Taken set. 



There are n variables of this type. 
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Definition B.15 The non-negative variable restj n describes the sum of all 
node-to-node flows ending in node j which are not in the Taken set. 

There are n variables of this type. 

Definition Ba6 The non-negative variable s pq describes the flow from PoP 
p to PoP q with pq € Find. These variables are called solution variables. 

There are \Find\ variables of this type. 
B.3 Constraints 

We now list the constraints used in our scalable model: The names of the 
constraints are taken from the "RiskWise II Constraint Model" document, 
but the constraints here are only approximations of the original constraints. 

Constraint B.l (flow_external(l)) The constraint states that the sum of 
all flows starting in a node is equal to the sum of all external traffic entering 

the node. _ in 

V ie Nodes: £ /« + restf = c? 

We have n constraints of this type. 

Constraint B.2 The constraint states that the variable rest? 1 of all non- 
taken flows starting in a node i is 0, if all flows starting in the node are m 
the Taken set. 

restf u = 0 if Vj € Nodes : ij e Taken 

The number of constraints of this type depends on the structure of the Taken 
set. There are at most n constraints of this type. 

Constraint B.3 (flow_external(l)) The constraint states that the sum of 
all flows ending in a node is equal to the sum of all external traffic leamng 

the node. . , 

V je Nodes : £ f ij + restf = c j 

ie Nodes 



We have n constraints of this type. 
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Constraint B.4 The constraint states that the variable rest*/ 1 of all non- 
taken ft ows ending in a node j is 0, if all flows ending in the node are in the 
Taken set 

res*j n = 0 if Vie Nodes: . ij m e Taken 

The number of constraints of this type depends on the structure of the Taken 
set. There are at most n constraints of this type. 

Constraint B.5 (traffic-contribution.equal) The constraint states that 
the sum of all flows through a line can be used to approximate the traffic on 
the line. 

V kl € Lines : £ rg * fa + £ rest f* < tkl 

i&laktn {iftijeNodesi ijeTakenVijsR kl } 

V kl e Lines : £ r# * fa + £ rest}" < *„ 

V kle Lines: t kl < £ r% * fa + £ restf" 

iiefwn {i|3ij€ii fc ,s.t.£i§eraA:en} 

V Aii € Lines : t« < £ rg * ^ + £ restf 

i/er^Kn {i|3ii€H W s.t.ijfSroA ! cn} 

We have m constraints of this type. 

Constraint B.6 (solution^term(l)) The constraint states that the flow 
between two PoPs is related to the sum of all flows between nodes which 
belong to the first and second PoP. We restrict ourselves to flows that are in 
the Find set of interesting flows, 

V pqe Find : £ fa < Spg 

»6p 

;€<? 

ijGTaken 

V pqe Find: s pq < £ /, j + £ res*?"' 

j|JJ {i|3x€p s.t. if gTaAren} 

ij'6TaJccn 

We have |F£nd| constraints of this type. 




B.4 Objective Functions 

As result of the flow analysis we want to have lower and upper bounds for 
all solution variables. This implies the objective functions 



min s pg 

and 



max Spg 

which are run independently for all pairs of PoPs pq in Find. This means 
that we have to run 2 * \Find\ optimization queries. We can reduce this 
value by checking if a particular solution variable s M has a value C > in j>ne of 
the other optimization runs. If that happens, we do not have to check it for 
minimum value. 

B.5 Complexity 

The model presented has \Taken\ + 2n + \Find\ variables and at most in + 
4m + 2 * \Find\ constraints. For a flow analysis we have to run at most 

2 * \Find\ queries. . . , , , 

On the other hand note that we still need to know the r% values, and for 
this we need to calculate all node to node routes in the network. 



B.6 Optimizer 

In analogy to the constraints above we can relax the definitions and constraints 
regarding the optimizer model. 



B .7 Choosing Taken and Find sets 

The above mode depends on a wise choice of the Taken and Find sets in order, to 
reduce the loss of accuracy of the model. The a priori bounds for flows 
mentioned below help with this. 
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A Priori Flow Bounds 

All flow variables have a lower bound of 0. An upper bound for each flow 
can be easily computed as 

Vi, j € Nodes : fij < t kt if ij g R kl 

A similar bound is found on the external traffic: 

Nodes : Jij < cf 

V«,i€ Nodes : fij<cf l 

These bounds can be trivially extended to the solution variables The re- 
sulting values may be very weak. Sometimes, a stronger bound is found 
by 

s P q<t k i if Vi€.p,j€q: r *? = \ 

All these bounds are expressed on the corrected traffic variables. Using 
the uncorrected values instead may lead to misleading results. 
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Handling Unknown Traffic Values 

The error correction model assumes that for each line in the model we know the 
observed traffic volume. Unfortunately, this may not always be the case A 
tvoical example are interfaces which are connected to backbone lines, but for 
which we can not collect any traffic data, due to problems of the data collector or 
of the network device. In the models described above, we allowed the observed 
traffic to be a variable of the model, which would be left uninstantiated in certain 
situations. These variables would be assigned a value in the error correction, 
choosing a number which would cause minimal error correction in other parts ot 
the network. 

Alternatively, set the traffic of such unobserved variables to 0 and to allow a free 
error correction on these interfaces. This will have the same overall effect on the 
model, but is simpler and leads to a more consistent view of the problem. 
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Whilst in the above described embodiments linear objective functions 
are described, it is appreciated that alternatively quadratic objective functions 
can be used. 

Whilst in the above described embodiments, IP networks are 
5 described, it is appreciated that the invention may be implemented into other 
types of communication networks. 

i 

Whilst in the above described embodiments the use of flow rates are 
described, it is appreciated that alternatively the whole calculation could also 
be done directly in volume. 
10 Whilst in the above described embodiments the traffic data 

measurements are obtained using the SNMP, it is appreciated that 
alternatively other methods can be used. 

Whilst in the above described embodiments a specific set of traffic 
data measurements obtained from the router and router interfaces are 
15 described, it is appreciated that other measurements can be used instead or in 
addition to the measurements described. 

It is to be understood that the embodiments described above are 
preferred embodiments only. Namely, various features may be omitted, 
modified or substituted by equivalents without department from the scope of 
20 the present invention, which is defined in the accompanying claims. 



CLAIMS: 
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1. A method of calculating traffic values in a communications network, 
the communications network comprising a plurality of nodes, the nodes being 
5 connected to one another by links, the method comprising: 

(a) obtaining traffic data measurements through said nodes and/or links in 
an initial scenario as input data; 

(b) deriving a traffic flow model for a modified scenario using a plurality 
of constraints describing the interdependency of said initial to said 

10 modified scenario; and 

(c) calculating values and/or upper and lower bounds of traffic values for 
said modified scenario from said traffic flow model using said input 
data. 

15 2. A method according to claim 1, wherein said modified scenario 
comprises one or more of: a modified network topology, modified routing 
algorithm parameters, modified traffic engineering constraints and/or 
modified traffic load compared to the initial scenario. 



20 



3. A method according to claim 1 or 2, wherein said constraints are 
derived from the network topology and network behaviour of the initial 
network. 
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4. A method according to claim 1, 2 or 3, wherein step (b) further 
comprises performing a routing procedure in said modified scenario. 

5. A method according to any preceding claim, further comprising the 
5 step of verifying the consistency of the measured input data using information 

about the network topology and/or the network behaviour of the initial 
scenario. 

6. A method according to any preceding claim, wherein said input data 
10 are corrected if inconsistencies are detected. 

7. The method of claim 6, further comprising solving a linear 
progr amming- problem w ith a lin e ar o bjc c tiv c- f unctk m to minimise the da ta 

traffic reconciliation (error correction). 

15 

8. The method of claim 6, further comprising solving a linear 
programming problem with a non-linear objective function to minimise the 
data traffic reconciliation (error correction). 

20 9. A method according to any preceding claim, wherein in step (b) the 
traffic values in the modified scenario are expressed as a linear function of 
node-to-node flows in the initial scenario. 
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10. A method according to any preceding claim, wherein traffic values 
which are not affected by the modification from said initial to said modified 
scenario are equal to the corresponding input data or corrected input data of 
the initial scenario. 

5 

11. A method according to any preceding claim, wherein said traffic 
values comprise utilisation, overload ■and/or traffic volume values. 

12. A method according to any preceding claim, wherein said constraints 
10 comprise linear constraints. 

13. A method according to any preceding claim, wherein said constraints 
comprise non-linear constraints. 

15 14. A method according to claim 13, wherein a linear approximation to a 
non-linear constraint is used. 

15. A method according to any preceding claim, further comprising: 
selecting a first and a second node; 
20 solving a first linear programming problem by computing the upper 

bound of traffic flow values from the first to the second node; and 

solving a second linear programming problem by computing the lower 
bound of traffic flow values from the first to the second set of nodes. 
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16. The method of any preceding claim, further comprising repeating step 




(a) at different times and/or at periodic intervals. 

5 17. A method of calculating traffic values in a communications network, 
the communications network comprising a plurality of nodes, the nodes being 
connected to one another by links, the method comprising: 

(a) obtaining data traffic data measurements through said nodes 
and/or links in an initial scenario as input data; 
10 (b) considering a modified scenario; 

(c) defining one or more solution variables for said modified 
scenario; 

(d) dete miin i ng - con straints— between— traf fi c flow s llnuugh said 

links and nodes to describe the network topology and 

15 behaviour of the network; 

(e) deriving a traffic flow model using said input data and said 
relations for calculating said solution variables. 

18. A method according to claim 17, wherein step (b) further comprises 
20 performing a routing process for said modified scenario. 
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19. A method according to claim 17 or 18, wherein said solution variables 
can be expressed as a linear function of one or more node-to-node flows of 
said network. 

5 20. The method of claim 17, 18 or 19, wherein said constraints in step (b) 
include relations among data traffic rates based on the definition of network 
protocol (such as IP, TCP, UDP) which defines the network behaviour. 

21. A method according to any of claims 17 to 20, wherein said 
10 constraints comprising any of the following constraints: 

- routing-based constraints 

- link-based constraints 

- node-based constraints 

- error-based constraints. 

15 

22. The method of any of claims 17 to 21, wherein said constraints relate 
to any of the following: 

- the size of data packets used in the network; 

- relationship between the number of data packets and the data traffic 

20 volume; 

- constraints determined by the routing protocol used in the network; 

- the relationship between incoming and outgoing data traffic at said 
plurality of nodes; 
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- the relationship between the data traffic at both ends of each link; 

- the relationship between the data traffic along said routes and the data 
traffic input into and output from the network. 

5 23. A method according to any of claims 17 to 22, further comprising 
repeating stages (b), (c) and (d) for different modifications of said network. 

24. A method according to claim 23, further comprising calculating a 
minimal and a maximal value for each solution variable taking into account 
10 one or more of said different modifications. 

25: A method according to claim 23 of 247 further comprising calculating 
one consistent solution for all solution variables taking into account all said 
modifications. 

15 

26. An apparatus for calculating traffic values in a communications 
network, adapted to perform the method of any preceding claim. 

27. A network management system for managing a network comprising a 
20 plurality of nodes, the nodes being interconnected by links, the network 

management system comprising: 

means for measuring the data traffic input into and output from nodes 
and links and 
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the apparatus of claim 26. 

28. A computer program for performing the method of any of claims 1 to 
25 when operated in a computer system. 

29. A method of calculating data traffic substantially as described herein 
with reference to the accompanying drawings. 
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ABSTRACT 

A method of calculating traffic values in a communications network, the 
communications network comprising a plurality of nodes, the nodes being 
connected to one another by links, the method comprising: (a) obtaining 
traffic data measurements through said nodes and/or links in an initial 
scenario as input data; (b) deriving a traffic flow model for a modified 
scenario using a plurality of constraints describing the interdependency of 
said initial to said modified scenario; and (c) calculating values and/or upper 
and lower bounds of traffic values for said modified scenario from said traffic 
flow model using said input data. 
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